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Beyond complete genomes: from sequence to structure and 
function. 



Koonin FA , Tatus(»\ RL, dalpcrin \I\ 

Naiiona! (^^Vnlor tor Bioicchnoloi^y Intbrniation. National l.ibrar}- of 
Medicine. National Inslilules of Health. Bethesda. Man land 20894. USA. 
k(u>nin t7 nchi.nl maiih.gox 

Computer anaK sis of complete prokarj otic genomes show s that microbial 
prcneins are in geneml highl}' consen ed--appro\imaiely 70% oflhem 
contain ancient ecMiserxed regions, This allows us to delineate lajmlies o[ 
orlhologs across a wide ph> logenetic range and. in many cases, predict 
protein functions with considerable precision. Sequence database searches 
using new.l\ de\e!-)ped. sensiti^ e algorithnns result in the unification ofsueh 
orthologous families into largei' superlamilies sharing common sequence 
molif>. For many (^t" these superfamilies. prediction of the siruciural told and 
specific amino acid residues in\ ol\ ed in enz) matic eatal\ sis is possible. 
Taken together, sequence and structure comparisons pro\ ide a powerful 
methodology that can successfulK complement iraditiv)nal experimental 
approaches. 
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